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GENETIC LOCI INDICATIVE OF PROPENSITY FOR LONGEVITY AND METHODS 
FOR IDENTIFYING PROPENSITY FOR AGE-RELATED DISEASE 

5 Related Applications 

[0001] This application is a continuation application of U.S. Application Serial No. 
09/928102, filed August 10, 2001, which claims the benefit of U.S. Application Nos. 60/224,643 
and 60/249,921, the disclosures of each of which is incorporated by reference herein. 

Government Support 

1 0 [0002] This invention was made with Government support under Grant No. AG1 872 1 -2 
awarded by the National Institutes of Health. The Government may have certain rights in this 
invention. 

Field of the Invention 

[0003] This invention relates to polymorphic markers indicative of the propensity for 
15 longevity and methods for using such markers, and more particularly to genetic loci on a 
specified region of human chromosome 4 associated with exceptional longevity. 

Background of the Invention 

[0004] It has long been suspected that a genetic predisposition exists for longevity in 
humans. For example, many studies have documented the inheritance of human longevity. 

20 McGue et al.,J. Gerontol. Biol. ScL, 48: B237-44 (1993); Ljungquist et al 9 1 Gerontol. Biol 

Set, 53: M441-46 (1998). Studies have also documented that centenarians (individuals who live 
for 100 years or more) are more likely than non-centenarians to have siblings who are long-lived. 
In particular, one study has shown that the siblings of centenarians have an approximately four- 
fold greater probability of survival to age 91 than siblings of non-centenarians. Perls et ai, 

25 Lancet, 351: 560 (1998). A more recent study has shown that female siblings of centenarians 
have over an eight-fold greater probability of survival to age 100 than siblings of non- 
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centenarians and male siblings of centenarians have over an eleven-fold greater probability of 
survival to age 97 than siblings of non-centenarians. 

[0005] Individuals who achieve exceptional longevity, such as centenarians, tend to live the 
majority of their lives in excellent health, demonstrating a rapid decline only at the end of their 
5 lives. Hitt et al, Lancet, 354: 652 (1999). A relative lack of polymorphic variants associated 
with diseases of aging may be one prerequisite to achieving exceptional longevity. For 
example, the absence of genetic polymorphisms among centenarians is exemplified by the rarity 
of the apolipoprotein E e-4 allele that has been associated with Alzheimer's disease and 
cardiovascular disease. Schachter et al, Nat. Genet., 6: 29-32 (1994); Rebeck et al, Neurology, 

10 44: 1513-16 (1994). Another prerequisite to achieving exceptional longevity may be the ability 
to modulate the rate of the aging process, which also appears to have a genetic component. For 
example, one study has shown that the offspring of centenarians had more favorable lipid profile 
characteristics compared to ethnically matched and unmatched controls. Barzilai et al, J. Am. 
Geriatr. Soc, 49: 76-79 (2001). These and other observations indicate that there may be one or 

1 5 more genetic loci that influence longevity. 

[0006] Genetic studies in other species including mammals indicate that specific genetic 
polymorphisms have powerful influences upon life span (defined by the age of the oldest 
member of the species). A number of studies on non-human species indicate that a relatively 
few genetic polymorphisms have a powerful influence upon the ability to achieve exceptional 
20 longevity. Many of those polymorphisms appear to play roles in basic mechanisms of 
metabolism and aging. 

[0007] Only recently have researchers begun to investigate a genetic link to exceptional 
longevity. One approach to determining the significance of genes having an influence on 
longevity is to screen for polymorphisms in human homologs and to determine the allelic 
25 frequencies among specific human phenotypes, such as centenarians, and compare them to 

ethnically matched younger controls. Alternatively, case-control studies can be performed on a 
/?rzor/-selected candidate genes chosen because of their hypothesized roles in fundamental 
mechanisms of aging. These basic mechanisms might include modulation of genomic instability, 
for example by DNA repair and anti-oxidant defenses, gene expression, cell proliferation and 
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senescence, maintenance of differential function, and signal transduction. The drawbacks of 
such studies include the improbability of picking the right gene to study out of the myriad known 
and unknown genes effecting the process of interest. 

[0008] Knowledge of longevity-associated polymorphisms would not only provide the 
benefit of predicting individual longevity, but would also provide the ability to predict the 
likelihood of age-associated diseases. Making such predictions would allow early prophylaxis 
that would reduce the severity of or would eliminate the occurrence of such diseases. 
Accordingly, there is a need in the art for genetic loci associated with longevity and methods of 
using polymorphic markers indicative of the propensity for longevity. Such genetic loci and 
methods are provided herein. 

Summary of the Invention 

[0009] This invention provides a genetic locus associated with extreme longevity. 
According to the invention, a specified region of human chromosome 4 is linked to the 
propensity for old age. In particular, the invention provides a longevity locus with a linkage at 
15 the D4S1564 marker (also referred to as AFM248zg9, 248zg9, and Z23817) on human 

chromosome 4. According to the invention, the presence of a familial linkage to the D4S1564 
marker on human chromosome 4 is indicative of a polymorphic variant associated with increased 
likelihood for longevity. Detection of an inherited variant at the D4S1564 marker, or a 
polymorphism within the longevity locus containing the D4S1564 marker, is indicative of the 
20 propensity for extreme old age. 

[0010] In a preferred embodiment, the invention provides a polymorphic marker contained in 
an approximately 10-20 cM region surrounding the D4S1564 marker. In a further embodiment, 
the invention provides a marker within 20 cM of position 1 12.6 on human chromosome 4. Also 
in a preferred embodiment, the invention comprises methods for detecting a longevity marker in 
25 a biological sample. Preferably, such methods comprise amplifying DNA in the region of human 
chromosome 4 comprising the D4S1564 marker. The amplification product comprises a region 
of human chromosome 4 that contains the locus associated with longevity. The common 
presence of a variant of the D4S1564 marker among related individuals, at least one of whom 



has lived to old age, is evidence of the presence of a polymorphic variant associated with the 
propensity for old age. 

[0011] Inherited variants of the D4S1564 marker in association with extreme longevity is 
indicative of the propensity for old age and the likelihood of avoiding disease of old age. For 
example, the invention provides the ability to predict the propensity for diseases such as heart 
disease, cardiovascular disease, stroke, Alzheimer's disease, cancer, ocular disease, and 
numerous others associated with the aging process. Accordingly, methods of the invention are 
useful not only to predict the likelihood of longevity, but are also useful to indicate possible early 
therapeutic intervention to prevent or to lessen the effects of diseases associated with aging. 
Further details of the use of the invention are provided below in the detailed description thereof. 

Description of the Drawings 

[0012] The invention is pointed out with particularity in the appended claims. The 
advantages of the invention described above, as well as further advantages of the invention, may 
be better understood by reference to the following detailed description taken in conjunction with 
the accompanying drawings, in which: 

[0013] Fig. 1 shows multipoint LOD scores for a genome scan on human chromosome 1 . 
[0014] Fig. 2 shows multipoint LOD scores for a genome scan on human chromosome 2. 
[0015] Fig. 3 shows multipoint LOD scores for a genome scan on human chromosome 3. 
[0016] Fig. 4 shows multipoint LOD scores for a genome scan on human chromosome 4. 
[0017] Fig. 5 shows multipoint LOD scores for a genome scan on human chromosome 5. 
[0018] Fig. 6 shows multipoint LOD scores for a genome scan on human chromosome 6. 
[0019] Fig. 7 shows multipoint LOD scores for a genome scan on human chromosome 7. 
[0020] Fig. 8 shows multipoint LOD scores for a genome scan on human chromosome 8. 
[0021] Fig. 9 shows multipoint LOD scores for a genome scan on human chromosome 9. 
[0022] Fig. 10 shows multipoint LOD scores for a genome scan on human chromosome 10. 
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[0036] Fig. 24 shows Kaplan-Meier curves depicting the probability of surviving to each age 
(conditional on surviving to at least age 20) of male and female siblings (n=2,092) of 
centenarians (n=444) to males and females from the same birth cohort (data from the U.S. Social 
Security Administration's 1900 cohort life table). 

Detailed Description of the Invention 

(0037] Compositions and methods of the invention are useful for predicting the propensity 
for exceptional longevity in humans. Additionally, compositions and methods of the invention 
are useful for predicting the propensity for age-related diseases including, but not limited to heart 
disease, cardiovascular disease, stroke, Alzheimer's disease, cancer, and ocular disease. 
Additionally, compositions and methods of the invention are useful for indicating possible early 
therapeutic intervention to prevent or to lessen the effects of diseases associated with aging. 



[0038] The present invention provides a polymorphic locus with significant influence upon 
longevity in humans. In particular, the invention provides a longevity locus with a linkage to the 
D4S1564 marker on human chromosome 4. The D4S1564 marker is also referred to as 
AFM248zg9, 248zg9, and Z23817. The D4S1564 marker is a DNA segment that is 
5 approximately 331 base pairs in length and contains a C-A repeat. Gyapay et aL 9 Nat. Genet, 
7(2): 246-339 (1994), which is incorporated by reference herein. The D4S1564 marker has 12 
alleles with an allele size range of 220-242 base pairs and is located at approximately the 1 12.6 
cM position on human chromosome 4. The D4S1564 marker is available on numerous Internet 
databases (see, e.g., Genbank, http://www.ncbi.nlm.nih.gov/; Genethon, 
10 http://www.genethon.fr/eng/indeng.html; Stanford Genome Center, http://www- 

shgc.stanford.edu/; Whitehead Institute Genome Center, http://www-genome.wi.mit.edu/; each 
of which is incorporated by reference herein). 

[0039] The following examples provide further details of the polymorphic markers and 
methods according to the invention. Accordingly, while exemplified in the following manner, 
1 5 the invention is not so limited and the skilled artisan will appreciate its wide range of application 
upon consideration thereof. 

EXAMPLE 1 

Linkage to Exceptional Longevity on Human Chromosome 4 at Marker D4S1564 

20 [0040] To investigate genetic markers that confer a substantial survival advantage in humans, 
genome- wide scans were performed on 308 individuals belonging to 137 sibships in which at 
least one sibling was 98 years old and at least one other sibling was at least 95 years old (female) 
or 91 years old (male). The genome- wide scans were performed using a linkage mapping set 
obtained from Applied Biosystems (Foster City, CA). DNA samples from each individual were 

25 initially genotyped with 400 tandem repeat markers with an average heterozygosity of 0.70 and 
an average marker density of 10 centiMorgans (cM). Size standards and alleles were determined 
using an allele calling program. 

[0041] Loci suggestive of linkage were genotyped with additional markers. Sibships in 
which the observed genotype data was 10 5 -fold more likely to have arisen if the individuals were 
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unrelated or half-siblings were excluded from the genetic analysis. In addition, those families 
with aberrantly high numbers of Mendelian inheritance errors were excluded. In suggestively- 
linked regions with higher density mapping, genotypes that implied double-recombinants were 
identified and the raw data were reexamined or retyped. 

5 [0042] An affecteds-only multipoint non-parametric analysis was performed. This analysis 
tested for excess sharing of chromosomal regions that were identical by descent in the sibships. 
The non-parametric analysis (MLS) was performed using the GENEHUNTER 2.0 program 
(Falling Rain Genomics, Inc., Lincoln, MA). 

[0043] Figs. 1-23 show multipoint LOD scores for a genome-wide scan of 308 individuals 
10 belonging to 137 families. The multipoint LOD scores in Figs. 1-23 are plotted as a function of 
specific markers on each chromosome. The horizontal threshold lines on each graph represent a 
MLS score of 2.0, a score slightly higher than the average maximum score expected by chance 
once in a genome scan. 

[0044] The genome screen of the 308 individuals identified a region on chromosome 4 as 
15 highly suggestive of linkage (MLS=2.67), as shown in Fig. 4. Fine mapping at an average of 1 
marker every 3 cM was performed in a 20 cM region around this peak resulting in an increased 
MLS=3.65 at marker D4S1564, as shown in Table 1 below. A drop off of 1.5 in the MLS score 
on either side of the peak MLS defines with 95% confidence the area in which the gene resides. 
A drop off in the MLS of 2 on either side of the peak is observed in a 20 cM region encompassed 
20 by D4S414 and D4S161 1. Table 1 below shows the MLS scores in this region and the HLod 
scores under the dominant model : 



TABLE 1 



Marker 


Position 


MLS 


Hloddom 


D4S1534 


95.0 


0.57 


0.54 


D4S414 


100.8 


1.53 


1.30 


D4S2986 


105.3 


2.78 


2.26 


D4ST572 


108.0 


3.07 


2.57 


D4S411 


109.0 


3.07 


2.60 


D4S1564 


112.6 


3.65 


3.26 


D4S406 


117.1 


2.55 


2.15 


D4S1611 


121.6 


1.70 


1.56 
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D4S402 


124.5 


1.39 


1.06 


D4S2975 


126.7 


0.94 


0.89 



[0045] In order to estimate the significance of the observation of linkage to chromosome 4, 
simulations under the hypothesis of no linkage were designed which match the family structure 
and marker heterozygosity as described above. One thousand genome scans were simulated with 
5 a genome-wide map density of 1 marker every three 3 cM (matching the fine mapping density 
performed in the region of maximum linkage) and in only 44 of those 1000 genome scans was 
the observed MLS of 3.65 exceeded (p=0.044). 

[0046] Previous studies indicate that parametric linkage analyses using both a dominant and 
recessive inheritance model may be more powerful in some cases than non-parametric studies for 
10 detecting genes contributing to complex traits. In addition, non-parametric methods do not 
easily allow for the presence of locus heterogeneity. Therefore, multipoint genome- wide 
affecteds-only parametric linkage analyses were conducted under both a dominant and recessive 
model with reduced-penetrance and allowing for heterogeneity. 

[0047] The finely mapped region of chromosome 4 provided the strongest results, with a 
1 5 maximum HLOD of 3.3 under the dominant model (with 45% of families linked) and a 

maximum HLOD of 2.9 under the recessive model (with 57% of families linked), both occurring 
at marker D4S1564. Empiric p-values were determined by genome-wide simulations under the 
dominant model. Of 1000 replicates, 30 had a LOD score greater than 3.3 under the dominant 
model (p=0.03) and 78 had a LOD score greater than 2.9 under the recessive model (p=0.08). A 
20 LOD score of 3.0 or higher is indicative of linkage. Because two models were evaluated, these 
parametric results are appropriately summarized and corrected by using the more significant 
result by multiplying the associated p-value by 2, thus producing a final parametric p-value of 
0.06. For this dataset and trait, the parametric analyses provide additional support for the 
findings of the non-parametric analyses, but do not provide greater statistical significance. 

25 [0048] These analyses provide significant evidence for heterogeneity, with odds in favor of 
heterogeneity of 1 800: 1 . The significant linkage based upon a sibling-pair study of modest 
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sample size indicates that there is at least one gene of significant importance in conferring 
exceptional longevity. 



EXAMPLE 2 

5 Propensity for Siblings of Centenarians of Achieving Exceptional Longevity 

[0049] To investigate the propensity of siblings achieving exceptional longevity, pedigree 
data from the families of centenarians was compiled and analyzed. The pedigrees of 444 
centenarian subjects revealed 2,092 siblings. Ages of death or for those still alive, ages when 
last censored, were tabulated. If there were multiple centenarians in a family, the oldest living 
10 centenarian was considered the proband. There were 356 female probands (80% of the 

centenarian subjects) with a median age of 106 years with a 95% confidence interval (CI) from 
105 to 107 years. There were 88 male probands (20% of the centenarian subjects) with a median 
age of 107 years with a 95% confidence interval (CI) from 104 to 108 years. The probands had a 
median of 5 siblings with a range of 1-15 siblings. 

15 [0050] The survival experience of the siblings of centenarians, segregated by gender, was 
estimated according the Kaplan-Meier product-limit method. Kaplan et. aL,J. Am. Statist 
Assoc., 53: 457-481 (1958). Standard errors of the survival probabilities were calculated using 
Greenwood's formula. Kalbfleisch et al., Statistical Analysis of Failure Time Data (1980). 
Standard errors were calculated assuming binomial probabilities at each age interval and using 

20 the number of centenarian siblings at risk of surviving each age interval as the effective sample 
size for that interval. The estimated median survival of the 1,075 female siblings (51% of the 
sample) was 86 years with a 95% confidence interval (CI) from 85 to 88 years. The estimated 
median survival of the 1,017 male siblings (49% of the sample) was 78 years with a 95% 
confidence interval (CI) from 76 to 80 years. Fig. 24 shows Kaplan-Meier curves for both 

25 gender groups among the individuals who achieved at least the age of twenty years. 

[0051] The propensity (X s ) of siblings surviving to specific ages was estimated for 
centenarian siblings relative to the U.S. Social Security Administration's cohort life table of 
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100,000 individuals from the year. The sibling propensity (X s ) was defined as the probability 
that a centenarian sibling survived to age x given that the sibling survived to age 20, divided by 
the probability that an individual born in 1900 survived to age x, given that the individual 
survived to age 20, where x is greater than 21 years of age. Pointwise 95% confidence intervals 
5 (CI) for sibling propenisty (X s ) was calculated according to the natural log scale and 

exponentiated. Data analysis was performed using the SAS v8.0 program (SAS Institute Inc., 
Cary, NC). 

[0052] Table 2 below compares the two groups for those individuals having achieved at least 
the age of 20 years, wherein the sibling propensity (X s ) according to age and gender is shown. 
10 The sibling propensity (X s ) determination was limited to age 97 for males and age 100 for 

females due to sample size. Beyond these ages, the lower 95% confidence interval (CI) dropped 
below 1. 

Table 2 



Survival 
to Age 
(yrs) 


A* Male 


Lower 
95% CI 


Upper 
95% CI 


Female 


Lower 
95% CI 


Upper 
95% CI 


21 


1.00 


0.95 


1.07 


1.00 


0.95 


1.06 


30 


1.01 


0.95 


1.07 


1.02 


0.96 


1.08 


35 


1.01 


0.95 


1.08 


1.03 


0.97 


1.09 


40 


1.03 


0.96 


1.10 


1.04 


0.98 


1.11 


45 


1.04 


0.97 


1.12 


1.06 


0.99 


1.13 


50 


1.08 


1.00 


1.16 


1.07 


1.01 


1.15 


55 


1.11 


1.02 


1.20 


1.09 


1.02 


1.17 


60 


1.18 


1.09 


1.28 


1.12 


1.04 


1.20 


65 


1.29 


1.17 


1.42 


1.16 


1.08 


1.25 


70 


1.48 


1.32 


1.65 


1.24 


1.14 


1.35 


75 


1.69 


1.46 


1.95 


1.36 


1.24 


1.49 


80 


2.03 


1.67 


2.47 


1.54 


1.37 


1.72 


85 


2.70 


2.00 


3.65 


1.84 


1.58 


2.14 


90 


4.09 


2.31 


7.23 


2.57 


2.03 


3.25 


91 


4.55 


2.33 


8.90 


2.77 


2.13 


3.61 


92 


5.36 


2.40 


11.97 


3.09 


2.28 


4.18 


93 


5.36 


2.40 


11.97 


3.45 


2.44 


4.89 


94 


5.36 


2.40 


11.97 


3.74 


2.48 


5.65 


95 


8.37 


1.92 


36.37 


4.28 


2.59 


7.09 
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96 


9.67 


1.51 


61.82 


4.76 


2.56 


8.88 


97 


11.31 


1.02 


125.4 


5.47 


2.48 


12.09 


98 








6.77 


2.42 


18.94 


99 








8.01 


2.11 


30.33 


100 








8.64 


1.32 


56.48 



[0053] The survival curves remained approximately the same for both groups through young 
adulthood (k s ~ 1). As shown in both Fig. 24 and Table 2, the survival experiences of the 
siblings of centenarians compared to the survival of the 1900 birth cohort began to diverge at 
5 ages 40 to 50 and became increasingly divergent at age 90 and beyond. For example, the sibling 
propensity (k s ) for men by age 97 was 11.3 and the sibling propensity (X s ) for women by age 100 
was 8.6. Thus, compared to the general population, the siblings of centenarians appear to have a 
markedly increased propensity for achieving exceptional longevity, suggesting a significant 
genetic contribution. 

10 [0054] While the invention has been shown and described with reference to specific 

preferred embodiments, it should be understood by those skilled in the art that various changes in 
form and detail may be made therein without departing from the spirit and scope of the invention 
as defined by the following claims. 



